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Abstract 

We provide sharp lower and upper bounds for the Gelfand widths of £ p -bal\s in the 
TV-dimensional i^-space for < p < 1 and p < q < 2. Such estimates are highly relevant 
to the novel theory of compressive sensing, and our proofs rely on methods from this area. 
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1 Introduction 

Gelfand widths are an important concept in classical and modern approximation and com- 
plexity theory They have found recent interest in the rapidly emerging field of compressive 
sensing [6| 114} [T7] because they give general performance bounds for sparse recovery methods. 
Since vectors in £ p -balls, < p < 1, can be well-approximated by sparse vectors, the Gelfand 
widths of such balls are particularly relevant in this context. In remarkable papers [26 } \21 \ [19] 
from the 1970s and 80s due to Kashin, Gluskin, and Garnaev, upper and lower estimates for 
the Gelfand widths of l\ -balls are provided. In his seminal paper introducing compressive 
sensing |17j . Donoho extends these estimates to the Gelfand widths of £ p -balls with p < 1. 
Unfortunately, his proof of the lower bound contains a gap. In this article, we address this 
issue by supplying a complete proof. To this end, we proceed in an entirely different way than 
Donoho. Indeed, we use compressive sensing methods to establish the lower bound in a more 
intuitive way. Our method is new even for the case p = 1. For completeness, we also give a 
proof of the upper bound based again on compressive sensing arguments. These arguments 
also provide the same sharp asymptotic behavior for the Gelfand widths of weak-£ p -balls. 

1.1 Main Result 

In this paper, we consider the finite-dimensional spaces £p , that is, R N endowed with the usual 
^ p -(quasi-)norm defined, for x E R , by 

N 



i/p 

, < p < oo, \\x oo := max \xA 

e=i,...,N 



For 1 < p < oo, this is a norm, while for < p < 1, it only satisfies the p-triangle inequality 

||z + y||S<IMIS+HC x,y£R N . (1.1) 
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Thus, || • ||p defines a quasi-norm with (optimal) quasi-norm constant C = max{l, 2 1 /p~ 1 }. As 
a reminder, || • ||x is called a quasi-norm on H N with quasi-norm constant C > 1 if it obeys 
the quasi-triangle inequality 



\x + y\\ x <C(\\x\\ x + \\y\\x), x,ye 



Other quasi-normed spaces considered in this paper are the spaces weak-c , , that is, R en- 
dowed with the i v oo-quasi-norm defined, for x G M. N , by 



\p,oo •" 



max 
=1,...JV 



£ 1/p \x* e \, 0<p<oo, 



on nt ^ Q Koiic := { x e || x || p < 1} and : = { x G R^ , ||x|| Pi00 < 1} of ^ 



where x* G R is a non- increasing rearrangement of x. We shall investigate the Gelfand widths 

pN 
<-q 

t 

We recall that the Gelfand width of order m of a subset K of R w in the (quasi-)normed 
space (R N , \\ ■ \\x) is defined as 



and ipoe for < p < 1 and p < q < 2. 



d m (K,X) : = inf sup 

AeR mxJV tinker A 

where kerA := G 1^,^ = 0} denotes the kernel of A It is well-known that the above 
infimum is actually realized [35J. Let us observe that d m (K,X) = for m> N when G K, 
so that we restrict our considerations to the case m < N in the sequel. Let us also observe 
that the simple inclusion B p C B pi x implies 

d m (B^£ N q )<d m (B^£ N q ). 

From this point on, we aim at finding a lower bound for d m (Bp ,£q) and an upper bound for 
d m (Bp OC ,£^) with the same asymptotic behaviors. Our main result is summarized below. 

Theorem 1.1. For < p < 1 and p < q < 2, there exist constants c p ^ q ,C Ptq > depending 
only on p and q such that, if m < N , then 

and, if p < I, 

CM mi . {i. !«i V"- 1 " <_ ^ O <_ c r „ min (i, !«i l*"*. (!,) 



In the case p = 1 and g = 2, the upper bound of (jl.2p with a slightly worse log-term was 
shown by Kashin in [26] by considering Kolmogorov widths, which are dual to Gelfand widths 
[291 135] . The lower bound and the optimal log-term for p = 1 and 1 < q < 2 were provided 
by Garnaev and Gluskin in [2H I19j . again via Kolmogorov widths. An alternative proof of 
the upper and lower estimates of (11. 2\ with p = 1 was given by Carl and Pajor in [10]. They 
did not pass to Kolmogorov widths, but rather used Carl's theorem [9] (see also [HIES]) that 
bounds in particular Gelfand numbers from below by entropy numbers, which are completely 
understood even for p, q < 1, see [38} 1251 127] . An upper bound for p < 1 and q = 2 was first 
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provided by Donoho [T7] with log(iV) instead of log (iV/m). With an adaptation of a method 
from |29| . Vybiral |39|, Lem. 4.11] also provided the upper bound of (|1.2p when < p < 1. In 
Section 3, we use compressive sensing techniques to give an alternative proof that provides the 
upper bound of (II. 3p , 

Donoho's attempt to prove the lower bound of (jl.2p for the case < p < 1 and q = 2 
consists in applying Carl's theorem and then using known estimates for entropy numbers, 
similarly to the approach by Carl and Pajor for p = 1. However, it is unknown whether Carl's 
theorem extends to quasi- norm balls, in particular to £ p -balls with p < 1. The standard proof 
of Carl's theorem for Gelfand widths [111 [29] uses duality arguments, which are not available 
for quasi-Banach spaces. We believe that Carl's theorem actually fails for Gelfand widths of 
general quasi-norm balls, although it turns out to be a posteriori true in our specific situation 
due to Theorem 11.11 

We shortly comment on the case q > 2. Since then \\v\\ q < ||i>||2 f° r ah v 6 M. N , we have 
the upper estimate 

< < C P , 2 min|l, ln(iV/ ^ ) + 1 } 1/P ' 1/2 . (1.4) 

The lower bound in (jl.2p extends to q > 2, but is unlikely to be optimal in this case. It seems 
rather that (ll.4p is close to the correct behavior. At least for p = 1 and q > 2, [20J gives lower 
estimates of related Kolmogorov widths which then leads to (see also [39J ) 

The latter matches (jl.4j) up to the log-factor. We expect a similar behavior for p < 1, but this 
fact remains to be proven. 



1.2 Relation to Compressive Sensing 

Let us now outline the connection to compressive sensing. This emerging theory explores the 
recovery of vectors x £ M N from incomplete linear information y = Ax G R m , where A 6 M. mxN 
and m < N. Without additional information, reconstruction of x from y is clearly impossible 
since, even in the full rank case, the system y = Ax has infinitely many solutions. Compressive 
sensing makes the additional assumption that x is sparse or at least compressible. A vector 
x G R N is called s-sparse if at most s of its coordinates are non-zero. The error of best s-term 
approximation is defined as 

a s (x) p := inf{||x — z\\ p ,z is s-sparse}. 

Informally, a vector x is called compressible if cr s (x) p decays quickly in s. It is classical to show 
that, for q > p, 

x\\ p , (1.5) 
x\\ P ,oo, D PtQ := (q/p - I)" 1 /" . (1.6) 

This implies that the balls Bp and B p oo with p < 1 serve as good models for compressible 
signals: the smaller p, the better x in Bp or in Bp >oa is approximable in by s-sparse vectors. 
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s l/p-l/q 
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The aim of compressive sensing is to find good pairs of linear measurement maps A £ M. mxN 
and (non-linear) reconstruction maps A : R m — >• R N that recover compressible vectors x with 
small errors x — A(Ax). In order to measure the performance of a pair (A, A), one defines, for 
a subset K of M. N and a (quasi-) norm || • \\x on R N , 



E m (K,X):= inf sup \\x - A(Ax) 



AeM mxJV ,A:R m ^M JV x&K 



Quantities of this type play a crucial role in the modern field of information based complexity 
|34j . In our situation, only linear information is allowed in order to recover K uniformly. 
The quantities E m (K,X) are closely linked to the Gelfand widths, as stated in the following 
proposition [TTJ Q3], see also 155]. 

Proposition 1.2. Let K C R N be such that K = -K and K + K C CiK for some C\ > 2, 
and let \\ ■ \\x be a quasi-norm on M. N with quasi-norm constant C2. Note that C\ = 2 if K is 
a norm ball and that C2 = 1 if \\ ■ \\x is a norm. Then 

C 2 l d m {K,X) < E m (K,X) < dd m {K,X). 

Combining the previous proposition with Theorem 11.11 gives optimal performance bounds 
for recovery of compressible vectors in Bp , < p < 1, when the error is measured in £ q , 
p < q < 2. Typically, the most interesting case is q = 2, for which we end up with 

f ln(iV/m) + 11 1/p - 1/2 „ fnN nN , „ f ]n(N/m) + I) 1/p ~ 1/2 
c P mm|l,^-J 1 <EUB?,£Z)<C p wm{l,-±-LJ 1 

For practical purposes, it is of course desirable to find matrices A G M. mxN and efficiently 
implementable reconstruction maps A that realize the optimal bound above. For instance, 
Gaussian random matrices A £ W mxN , i.e., matrices whose entries are independent copies 
of a zero-mean Gaussian variable, provide optimal measurement maps with high probability 
[SI [13 II]- An optimal reconstruction map is obtained via basis pursuit [131 El [H], i.e., via the 
^-minimization mapping given by 

Ai(y) := argmin \\z\\i subject to Az = y. 

This mapping can be computed with efficient convex optimization methods [2], and works very 
well in practice. The proof of the lower bound in (|1.2[) will further involve £ p -minimization for 
< p < 1 via the mapping 

A p (y) := argmin ||z|| p subject to Az = y. 

A key concept in the analysis of sparse recovery via £ p -minimization is the restricted isometry 
property (RIP). This well-established concept in compressive sensing [HI [7] is the main tool 
for the proof of the upper bound in (jl.3p . We recall that the sth order restricted isometry 
constant S S (A) of a matrix A £ M. mxN is defined as the smallest 5 > such that 

(1 - S)\\x\\l < \\Ax\\l < (1 + S)\\x\\l for all s-sparse x £ R N . 

Small restricted isometry constants imply stable recovery by £i-minimization, as well as by 
^-minimization for < p < 1. For later reference, we state the following result [71 15} US]. 
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Theorem 1.3. Let < p < 1. If A £ M. mxN has a restricted isometry constant o~2 S < v2 — 1, 
f/ien, /or aZZ cc G M. , 

\\x-\{Ax)\\l<Ca s {xY p , (1.7) 

where C > is a constant that depends only on 82s- In particular, the reconstruction of s-sparse 
vectors is exact. 

Given a prescribed < 8 < 1, it is known [H [HJ [31] that, if the entries of the matrix A 
are independent copies of a zero-mean Gaussian variable with variance 1/m, then there exist 
constants Ci, C2 > (depending only on 6) such that <5 S (A) < 8 holds with probability greater 
than 1 — e~ C2m provided that 

m>ds\n(eN/s). (1.8) 

In particular, there exists a matrix A £ R mxAr such that the pair (.A, Ai), and more generally 
(A, A p ) for < p < 1, allows stable recovery in the sense of (|1.7p as soon as the number of 
measurements satisfies (|1.8|) . Vice versa, we will see in Theorem 12.71 that the existence of any 
pair (A, A) allowing such a stable recovery forces the number of measurements to satisfy (|1 .8j) . 

Lemma 12.41 which is of independent interest, estimates the minimal number of measure- 
ments for the pair (A, A p ) to allow exact (but not necessarily stable) recovery of sparse vectors. 
Namely, we must have 

m > cipsln(N/(c 2 s)) (1.9) 

for some explicitly given constants c\,C2 > 0. In the case p = 1, this result can be also 
obtained as a consequence of a corresponding lower bound on neighborliness of centrosymmetric 
polytopes, see [TU [28]. Decreasing p while keeping N fixed shows that the bound f)l .9j) becomes 
in fact irrelevant for small p, since the bound m > 2s holds as soon as there exists a pair (A, A) 
allowing exact recovery of all s-sparse vectors, see [141 Lem. 3.1]. Combining the two bounds, 
we see that s-sparse recovery by ^-minimization forces 

m > Cis(l+pln(N/(C 2 s)j), 

for some constants C\,C 2 > 0. Interestingly, if such an inequality is fulfilled (with possibly 
different constants C\, C2) and if A is a Gaussian random matrix, then the pair (A, A p ) allows 
s-sparse recovery with high probability, see [12] . We note, however, that exact ^-minimization 
with p < 1, as a non-convex optimization program, encounters significant difficulties of imple- 
mentation. For more information on compressive sensing, we refer to [4"1 [6l 151 [14" 1 117 1 157] . 
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2 Lower Bounds 

In this section, we use compressive sensing methods to establish the lower bound in (|1.2p . hence 
the lower bound in (jl.3p as a by-product. Precisely, we show the following result, in which the 
restriction q < 2 is not imposed. 
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Proposition 2.1. For < p < 1 and p < q < oo, there exists a constant c Pj(? > such that 

™, \r m, f ln(eN/m) ) 1 /p- 1 /i 

d m (B^,e^)>c M mm{l, l J ; | , m<iV. (2.1) 

The proof of Proposition 12. II involves several auxiliary steps. We start with a result |23[ 124] 
on the unique recovery of sparse vectors via ^-minimization for < p < 1 . A proof is included 
for the reader's convenience. We point out that, given a subset S of [N] := {1, ■■■,N} and a 
vector v G R , we denote by the vector that coincides with u on S and that vanishes on 
the complementary set S c := [N] \ S. 

Lemma 2.2. Let < p < 1 and N, m,s G N mi/i m,s < N. For a matrix A G R mxAr , i/ie 
following statements are equivalent. 

(a) Every s-sparse vector x is the unique minimizer of \\z\\ p subject to Az = Ax, 

(b) A satisfies the p-null space property of order s, i.e., for every v G ker A \ {0} and every 
S C [N] with \S\<s, 

\\vs\\ p P < 2 II^IIp ■ 

Proof: (a) => (6): Let u G ker A \ {0} and S C [iV] with |5| < s. Since v = vs + vs? satisfies 
Av = 0, we have Avs = A(—vs<:)- Then, since vs is s-sparse, (a) implies 

\\ v s\\p < II - v s 4p = \\ v s4p- 

Adding \\vs\\p on both sides and using ||«sc||^ + \\vs\\ p = \\v\\ p gives (b). 

(b) (a): Let x be an s-sparse vector and let S := suppx. Let further z / x be such that 
= Ar. Then v := x — z G ker A \ {0} and 

IM|£ < Harsr - z s \\l + ll^ll^ = \\vs\\ p p + |ks||? , (2-2) 

where the first estimate is a consequence of the p-triangle inequality ([Lip . Clearly, (b) implies 
ll^sllp < Iks' 1 lip- Plugging this into (12.2j) and using that = — z^c gives 

\\x\\ P P < \\ v s4p + \\ z s\\p = \\zs4p + Iksll? = \\4p- 
This proves that x is the unique minimizer of ||-z|| p subject to Az = Ax. ■ 

The next auxiliary step is a well-known combinatorial lemma, see for instance \32\ [31 I22| . 
[301 Lem. 3.6]. A proof that provides explicit constants is again included for the reader's 
convenience. 

Lemma 2.3. Let N, s G N with s < N . There exists a family IA of subsets of [N] such that 

(i) Every set in IA consists of exactly s elements. 

(ii) For all I,JeU with I 7^ J, it holds \I fl J\ < s/2. 
(Hi) The family U is "large" in the sense that 
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Proof: We may assume that s < N/4, for otherwise we can take a family IA consisting of just 
one element. Let us denote by B(N, s) the family of subsets of [N] having exactly s elements. 
This family has size \B(N,s)\ = ( s ). We draw an arbitrary element I± G B(N,s) and collect 
in a family Ai all the sets J G B(N, s) such that \I f] J\ > s/2 . Then A\ has size at most 

the latter inequality holding because [s/2\ < (N — s)/2 when s < N/2. We throw away 
A\ and observe that every element in J € B(N,s) \ Ai satisfies \I\ n J\ < s/2. Next we 
draw an arbitrary element I2 G B(N,s) \ A\, provided that the latter is not empty. We 
repeat the procedure, i.e., we define a family A2 relative to I2 and draw an arbitrary element 
^3 G B(N, s) \ (Ai U A2), and so forth until no more elements are left. The size of each set Ai 
can always be estimated from above by (12.3p . This results in a collection U = {1%, . . . of 
subsets of [N] satisfying (i) and (ii). We finally observe that 



( N s ) _ 1 N(N-l)-(N-s + l) 1 



L > 



2-(J-') 2- (N - s)(N — s — 1) • • • (N — s — [s/2\ + 1) s(s - 1) • • ■ (Ls/2j + 1) 



1 N(N-1)---(N- \s/2] +1) 1 /JV\r«/al /JVy/2 
~ 2~ s s(s -!)■■■ (s- [s/2] +1) ~ 2 1 V"7/ _ v4s/ 

This concludes the proof by establishing (iii) . ■ 

We now use Lemma 12.31 for the final auxiliary result, which is quite interesting on its own. It 
gives an estimate of the minimal number of measurements for exact recovery of sparse vectors 
via £ p - minimization, where < p < 1. 

Lemma 2.4. Let < p < 1 and N,m,s G N uni/i m < N and s < N/2. If A G M mxAr is a 
matrix such that every 2s-sparse vector x is a minimizer of \\z\\ p subject to Az = Ax, then 

, / # 

m > cips In I 

Vc 2 s 

where c\ := 1/ In 9 ~ 0.455 and C2 := 4 . 

Remark 2.5. Lemma 2.6 could be rephrased (with modified constants) by replacing 2s-sparse 
vectors, s > 1, by s-sparse vectors, s > 2. In the case 8 = 1, it is possible for every 1-sparse 
vector x to be a (nonunique) minimizer of \\z\\\ subject to Az = Ax, yet m > cipln(A^/c2) fails 
for all constants c\,c% > 0. This can be verified by taking m = 1 and A = [l 1 • • • 1 l] . 

Proof: We consider the quotient space 

X := R N /ker A = {[x] := x + ker A, x G R N }, 
which has algebraic dimension r := rank A < m. It is a quasi-Banach space equipped with 

\\[x]\\ A ,p ■= jnf ||x + u|| P . 



Indeed, a simple computation reveals that || • \\a, p satisfies the p-triangle inequality, i.e., 

\M + [y}\\X P <M\\ p A, p + \\[y}\\ P A, p - 
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By assumption, the quotient map [•] preserves the norm of every 2s-sparse vector. We now 
choose a family U of subsets of [N] satisfying (i), (ii), (iii) of Lemma HOI For a set I G U, we 
define an element xj € & with = 1 by 



■iei 



where (ei, . . . , ejv) denotes the canonical basis of K . For I,JeU, I 7^ J, (ii) yields 

2s-2s/2 n 
x/ - Xj £ > = 1 . 

Since the vector x/ — xj is a 2s-sparse vector, we obtain 

||[x/] - [xj]||a, p = \\[xi - xj]\\a, p = \\xi - xj\\ p > 1 . 

The p-triangle inequality implies that {[xj] + (l/2) 1 ' p Bx, I € U} is a disjoint collection of 
balls included in the ball (3/2) 1 / p Bx, where Bx denotes the unit ball of (X, \\ ■ \\a,p)- Let vol(-) 
denote a volume form on X, that is a translation invariant measure satisfying vol(-Bx) > 
and vol(A-Bx) = X r vol(XBx) for all A > (such a measure exists since X is isomorphic to R r ). 
The volumes satisfy the relation 

£>ol([x r ] + (l/2) 1 /f b x ) < vol((3/2) 1 /P 5x ). 
ieU 

By translation invariance and homogeneity, we then derive 

\U\ (l/2)^vol(B x ) < (3/2) r / p vol{B x ) . 
As a result of (iii), we finally obtain 

— ) < 3 r/p < 3 m/p . 



Taking the logarithm on both sides gives the desired result. ■ 

Now we are ready to prove Proposition 12.11 . The underlying idea is that a small Gelfand 
width would imply 2s-sparse recovery for s large enough to violate the conclusion of Lemma [2.4i 

Proof: With c := (l/2) 2 / p ~ l / q and d := 2cip/(4 + ci) ~ 0.204p, we are going to prove that 

d m «,^) > c^- 1 ^, where p : = min{l, ^^ j. (2.5) 

The desired result will follow with c Pt q := cd l / p ~ 1 i q . By way of contradiction, we assume that 
d m {B^,i^) < cn 1/p ~ 1/q . This implies the existence of a matrix A £ R mxN such that, for all 
v € ker A \ {0}, 

H, < c^l p - l l q \\v\\ p . 

For a fixed v G ker ^4\{0}, in view of the inequalities \\v\\ p < A rl / p ~ 1 / <? ||?;|| l 2 and c < (l/2) 1 / p ~ 1 / q , 
we derive 1 < (ixN /2) 1 l p - 1 / q , so that 1 < < iV/2. We then define s := |1/mJ > 1) so that 

1 1 

— < s < -. 
2// ji 
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Now, for v E ker A \ {0} and S C [N] with |5| < 2s, we have 

IMI„ < (2s) 1 ^- 1 ^\\v s \\ q < (2*) 1/p - 1/5 |MI, < c(2s^-^\\v\\ p < ^\\v\\ P - 

This shows that the p-null space property of order 2s is satisfied. Hence, Lemmas 12.21 and 12.41 
imply 

/ N \ 

m > cipsln . (2-6) 

\c 2 s/ 

Besides, since the pair (A, A p ) allows exact recovery of all 2s-sparse vectors, we have 

m>2(2s)=c 2 s. (2.7) 

Using (1221) in ([22]), it follows that 

N\ , feN\ cip , feN\ c\ 

I — r-i ns ~~> n » 



m > cips In — = cips In — c\ps > In -m. 

•.mJ v m J 2\x \ m J 4 



After rearrangement, we deduce 

2c\p ln( eN/m) 2c\p \n(eN/m) 

TTl ^> ^ Tfl = TYl. 

4 + ci min {l, d In (eN/m)/m} ~ 4 + c\ dln(eN/m) 
This is the desired contradiction. ■ 

Remark 2.6. When m is close to N, the lower estimate (|2.5p is rather poor. In this case, a 
nice and simple argument proposed to us by Vybiral gives the improved estimate 



a™(B^t«)>{-—) , m<N. 



Indeed, for an arbitrary matrix A G M. mxN , the kernel of A and the (m + 1)- dimensional space 
{x G M. N : Xi = for all i > m + 1} have a nontrivial intersection. We then choose a vector 
v ^ in this intersection, and (|2.8p follows from the inequality \\v\\ p < (m + l) 1 / p ~ 1 /' 2 ||f \\ q . 

We close this section with the important observation that any measurement/reconstruction 
scheme that provides ^-stability requires a number of measurements scaling at least like the 
sparsity times a log-term. This may be viewed as a consequence of Propositions 11.21 and 12.11 
Indeed, fixing p < 1, the inequalities (|1.7p and (|1.5p imply 

d m (B^,l?) < E m (B?,£?) < C sup <7 a (s)! < 

x€B* S 1 /P 1 

The lower bound (|2. 1 j) for the Gelfand width then yields, for some constant c, 

ln(eN/m) 



c min < 1, 



m 
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We derive either s < 1/c or m > cs\a{eN /m). In short, if s > 1/c, then £i-stability implies 
m > cs ln(eN/m) — which can be shown to imply in turn m > c's ln(eN/s). We provide below 
a direct argument that removes the restriction s > 1/c. It uses Lemma 12.31 and works also for 
^p-stability with p < 1. It borrows ideas from a paper by Do Ba et al. \15\ Thm. 3.1], which 
contains the case p = 1 in a stronger non-uniform version. 
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Theorem 2.7. Let N,m,s 6 N m,s < N. Suppose that a measurement matrix A 6 M. mxN 
and a reconstruction map A : M. N — > R m are stable in the sense that, for all x E R , 

||x-A(vlx)||P<C7a s (x)P 

/or some constant C > and some < p < 1. T/ien i/iere exists a constant C > depending 
only on C such that 

m > C ps ln(eiV/s). 

Proof: We consider again a family of subsets of [N] given by Lemma 12.31 For each I €U, 
we define an s-sparse vector xj with ||a;j||p = 1 as in (|2.4p . With p := (2(C + l))~ 1//p , we claim 
that + pB*?),I G is a disjoint collection of subsets of A(M. ), which has algebraic 

dimension r < m. Suppose indeed that there exist I,JeU with I ^ J and z, z' € pB p such 
that A{xj + z) = A{xj + z'). A contradiction follows from 

1 < \\ XI - xj\\p < \\ XI + z - A(A(xj + z)W p + \\xj + z'- A(A(xj + z')W p + \\z\\p + ||/||£ 

< Ca s (xj + z)l + Ca s (xj + z% + \z% + \\z'f v 

< C\\z\\ p p + C\\z'\\ p + ||^||P + \\z'\\l < 2(C + 1)/ = 1. 



We now observe that the collection {A(xi + pB p ),L £ hi} is contained in (1 + p p ) 1 ^ p A(B, 
As in the proof of Lemma 12.44 we use a standard volumetric argument to derive 



N ) 



\U\p r vol{A(B?)) = -£vol[A( Xl + pB»)) < vol((l + /) 1 /PA(B p v )) = (l + /)^vol(^«)). 
leu 



—Y /2 < (p~ p + l) r/p < {p~ p + l) m/p = (2C + 3) m/p . 



We deduce that 



Taking the logarithm on both sides yields 

m > cpsln(iV/(4s)), with c := l/(21n(2C + 3)). 
Finally, noticing that m >2s because the pair (A, A) allows exact s-sparse recovery, we obtain 

cln(4e) 

m > cpsln(eN/s) — cps ln(4e) > cpsln(eiV/s) m. 

The desired result follows with C := (2c)/(2 + cln(4e)). ■ 



3 Upper Bounds 

In this section, we establish the upper bound in (|1.3|) . hence the upper bound in (jl.2j) as 
a by-product. As already mentioned in the introduction, the bound for the Gelfand width 
of £ p -balls was already provided by Vybiral in |39j . but the bound for the Gelfand width of 
weak-^p-balls is indeed new. 

Proposition 3.1. For < p < 1 and p < q < 2, there exists a constant C p ^ q > such that 

^(B^SC^l!,^^} 1 ''-''' , m<N. (3.1) 
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The argument relies again on compressive sensing methods. According to Proposition 11.2^ 
it is enough to establish the upper bound for the quantity E m (Bp OC ,£^). This is done in the 
following theorem, which we find rather illustrative because it shows that, even when p < 1, an 
optimal reconstruction map A for the realization of the number E m (B p>00 ,£ q ) can be chosen 
to be the ^-minimization mapping, at least when q > 1. The argument is originally due to 
Donoho for q = 2 \17\ Proof of Theorem 9] and can be extended to all 2 > q > p. 

Theorem 3.2. For < p < 1 and p < q < 2, there exists a matrix A E R mxN such that, with 
r = min{l, q}, 

\n(N/m) 



sup ||x — A r (Ac)||q < C Pt q min < 1, 

x&Bff„ ' ' 1 "' 



where C Ptq > is a constant that depends only on p and q. 

Proof: Let C\ be the constant in (II. 8p relative to the RIP associated with 5 = 1/3, say. We 
choose a constant D > large enough to have 

D/2>e - i + mp/2) >cv 

We are going to prove that, for any x £ B^^, 

a / a \ 1 1 f Dln(eN/m)} Vp-V? , . 

\\x-A r (Ax)\\ q <C' P)q mm[l, ^ ; | (3.2) 

for some constant C' p q > 0. This will imply the desired result with C Ptq := C' p q D l l p ~ 1 / q . 
Case 1: m> Dln(eN/m). 

We define s > 1 as the largest integer smaller than m/{D ln(eiV/m)), so that 

< s < KTZT^TJZX - (3-3) 



2D\n{eN/m) ~ Dln(eN/m) 

Putting t = 2s and noticing that t/m < 2/D < 1/e and that u i— > u\a{u) is decreasing on 
[0, 1/e], we obtain 

m > —thx{eN/m) = —tln(eN/t) + — m {t/m) ln(i/m) > —tln(eN/t) - m \n(D/2), 
so that 

m > 1 + i n [ 2 D/2) tln (eN/t) > dt\n{eN/t). 

It is then possible to find a matrix A G M. mxN with St(A) < 5. In particular, we have 
o~s(A) < (5. Now, given v := x — A r (Ax) £ ker^4, we decompose [N] as the disjoint union of 
sets Si, S2, S3, . . . of size s (except maybe the last one) in such a way that \vi\ > \vj\ for all 

i e S k -i, j e S k , and k>2. This easily implies (||fs fe |||/s) 1/2 < llr/ s ) > i- e -> 

\\v Sk h < i/ r -i/ 2 H^fc-i llr. ^^2- (3-4) 
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Using the r-triangle inequality, we have 

E^II^E i wis * E (* 1/9 - 1/2 iKii 2 ) r < E ( £ ^ l|j4 ^ 112 
fe>i 9 k>i fc>i fc>i ^ 



Mi, 



The fact that u S ker ^4 implies -Aus^ = — ^ k>2 Avs k - It follows that 

V fc>2 V fc>2 

1/5-1/2. r , [Y^J 

fc>2 fc>2 



We then derive, using the inequality (|3.4p . 



fe>i 



In view of the choice (5 = 1/3 and of (j3.3j) , we deduce 



\x - A r (Ax)\\ q < 2^V2( 2DHeN/m Y r - 1/q \\* ~ MAx)\\ r . (3.5) 



m 



Moreover, in view of < 1/3 and of Theorem 1 1.3^ there exists a constant C > such that 

\\x - A r (Ax)\\ r < C 1/r a s (x) r . (3.6) 
Finally, using (|1.6p and (|3.3p . we have 

^(^)r< ^Tpi^ < Av( J ■ (3.7) 

Putting ()3.5p . (|3.6fl . and (13. 7p together, we obtain, for any x 6 Bp j00 , 

||* - A r (^)|| 9 < C;, g (^ ln ^) 1/P " 1/9 = C^min {l, 

where := C^i^/H-i/a+l/p-l/^ 
Case m < Dln(eN/m). 

We simply choose the matrix A £ R mx ^ as A = 0. Then, for any x G B^^, we have 



|x A r (j4*) ||q — H^Hq 5; ^p,gll a 'llpiOO — ^33,5' 



for some constant C p '' 9 > 0. This yields 

||,-A. r (A,)ll,<^min{l, P '" (eiV/ "" } 1/ '-^ 



??? 



Both cases show that (|3.2p is valid with C' := max{C'' 9 , Cp' 9 }. This completes the proof. 



Remark 3.3. The case p = 1, for which r = 1, is not covered by our arguments. Since 
sup xeB N (Js(x)i x log(A r /s) the quantity cr s (x)i cannot be bounded by a constant times \\x\\i )00 
in order to obtain f|3.Tj) . Instead, the additional log-factor log(iV / m) appears on the right-hand 
side and therefore in the upper estimate of (II. 3p in the case p = 1. The correct behavior 
of the Gelfand widths of weak-t\-balls does not seem to be known. Nonetheless, the inequality 
o~ s (x)i < ll^lli * s always true. This yields the well-known upper estimate for the Gelfand widths 
of i\-balls and hence completes the proof of Theorem \l.l\ 
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